Basic Statistics

Raw Counts

Name Value
Rows 12,600
Columns 50
Discrete columns 36
Continuous columns 14
All missing columns 0
Missing observations 36,452
Complete Rows 3,032
Total observations 630,000
Memory allocation 9.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 16 columns ignored with more than 50 categories.
## PWD_UPDT_TS: 9656 categories
## CARR_NAME: 541 categories
## STATE_PRVNC_TXT: 126 categories
## PH_NUM_UPDT_TS: 5885 categories
## CUST_SINCE_DT: 8031 categories
## TRAN_TS: 12185 categories
## TRAN_DT: 336 categories
## ACTVY_DT: 336 categories
## PWD_UPDT_TS.1: 9656 categories
## CARR_NAME.1: 541 categories
## STATE_PRVNC_TXT.1: 126 categories
## PH_NUM_UPDT_TS.1: 5885 categories
## CUST_SINCE_DT.1: 8031 categories
## TRAN_TS.1: 12185 categories
## TRAN_DT.1: 336 categories
## ACTVY_DT.1: 336 categories

QQ Plot

Correlation Analysis

## 18 features with more than 20 categories ignored!
## PWD_UPDT_TS: 3018 categories
## CARR_NAME: 327 categories
## STATE_PRVNC_TXT: 88 categories
## CUST_STATE: 46 categories
## PH_NUM_UPDT_TS: 2901 categories
## CUST_SINCE_DT: 2680 categories
## TRAN_TS: 3011 categories
## TRAN_DT: 260 categories
## ACTVY_DT: 260 categories
## PWD_UPDT_TS.1: 3018 categories
## CARR_NAME.1: 327 categories
## STATE_PRVNC_TXT.1: 88 categories
## CUST_STATE.1: 46 categories
## PH_NUM_UPDT_TS.1: 2901 categories
## CUST_SINCE_DT.1: 2680 categories
## TRAN_TS.1: 3011 categories
## TRAN_DT.1: 260 categories
## ACTVY_DT.1: 260 categories
## Warning in cor(x = structure(list(X = c(3L, 4L, 5L, 6L, 8L, 9L, 15L, 18L, : the standard deviation is zero

Principal Component Analysis

## 16 features with more than 50 categories ignored!
## PWD_UPDT_TS: 3018 categories
## CARR_NAME: 327 categories
## STATE_PRVNC_TXT: 88 categories
## PH_NUM_UPDT_TS: 2901 categories
## CUST_SINCE_DT: 2680 categories
## TRAN_TS: 3011 categories
## TRAN_DT: 260 categories
## ACTVY_DT: 260 categories
## PWD_UPDT_TS.1: 3018 categories
## CARR_NAME.1: 327 categories
## STATE_PRVNC_TXT.1: 88 categories
## PH_NUM_UPDT_TS.1: 2901 categories
## CUST_SINCE_DT.1: 2680 categories
## TRAN_TS.1: 3011 categories
## TRAN_DT.1: 260 categories
## ACTVY_DT.1: 260 categories
## Warning in plot_prcomp(data = structure(list(X = c(3L, 4L, 5L, 6L, 8L, 9L, : The following features are dropped due to zero variance:
##  * ACTN_CD_SCHPMT
##  * ACTN_INTNL_TXT_P2P_COMMIT
##  * TRAN_TYPE_CD_P2P
##  * ACTN_CD.1_SCHPMT
##  * ACTN_INTNL_TXT.1_P2P_COMMIT
##  * TRAN_TYPE_CD.1_P2P